From Ordinary Differential Equations to Structural Causal Models: 

the deterministic case 
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Abstract 



We show how, and under which conditions, 
the equilibrium states of a first-order Ordi- 
nary Differential Equation (ODE) system can 
be described with a deterministic Structural 
Causal Model (SCM). Our exposition sheds 
more light on the concept of causality as ex- 
pressed within the framework of Structural 
Causal Models, especially for cyclic models. 



1 Introduction 

Over the last few decades, a comprehensive the ory for 
acycl i c causal mode l s was developed (e.g., see (|Pearl 
2000l : ISpirtes et all Il993l )l In particular, different, 
but related, approaches to causal inference and mod- 
eling have been proposed for the causally sufficient 
case. These approaches are based on different start- 
ing points. One approach starts from the (local or 
global) causal Markov condition and links observed in- 
dependences to the causal graph. Another approach 
uses causal Bayesian networks to link a particular fac- 
torization of the joint distribution of the variables to 
causal semantics. The third approach uses a structural 
causal model (sometimes also called structural equa- 
tion model or functional causal model) where each ef- 
fect is expressed as a function of its direct causes and 
an unobserved noise variable. The relati onships be- 
tween these aproa ches are well understood ( Lauritzenl 
199rilPear]ll2000h . 



Dominik Janzing 

Max Planck Institute 

for Intelligent Systems 

Tubingen, Germany 



Bernhard Scholkopf 

Max Planck Institute 

for Intelligent Systems 

Tubingen, Germany 



Over the years, several attempts have been made to 
extend the theory to the cyclic case, thereby enabling 
causal modeling of systems that involve feedback 
(ISpirted. ll995HKosterLll996l: IPearl and Dechterl .ll996t 



Neall . l200d; iHvttinen et all J2012I) . However, the rela- 
tionships between the different approaches mentioned 
before do not immediately generalize to the cyclic case 
in general (although partial results are known for the 
linear case and the discrete case). Nevertheless, several 



algorithms (starting from different assumptions) have 
been proposed for i nferring cycl i c cau s al models from 
observational data (IRichardson . 1996; Lacerda et al. 



2008 ; ISchmidt and Murphvl 120091: lltani et all [2010; 



Mooii et all 12011) 



The most straightforward extension to the cyclic case 
seems to be offered by the structural causal model 
framework. Indeed, the formalism stays intact when 
one simply drops the acyclicity constraint. However, 
the question then arises how to interpret cyclic struc- 
tural equations. One option is to assume an under- 
lying discrete-time dynamical system, in which the 
struct ural equations are used as fixed point equa- 
tions dSpirtesl. 119951: iDashl. 120051: lLacerda et all [2008; 



Mooii et all koill iHvttinen et all J2012I) . i.e.. they 
are used as update rules to calculate the values at 
time t + 1 from the values at time t, and then one 
lets t — > 00. Here we show how an alternative in- 
terpretation of structural causal models arises natu- 
rally when considering systems of ordinary differen- 
tial equations. By considering how these differential 
equations behave in an equilibrium state, we arrive at 
a structural causal model that is time independent, 
yet where the causal semantics pertaining to inter- 
ventions is still valid. As opposed to the usual inter- 
pretation as discrete-time fixed point equations, the 
continuous-time dynamics is not defined by the struc- 
tural equations. Instead, we describe how the struc- 
tural equations arise from the given dynamics. Thus 
it becomes evident that different dynamics can yield 
identical structural causal models. This interpretation 
sheds more light on the meaning of structural equa- 
tions, and does not make any substantial distinction 
between the cyclic and acyclic cases. 

It is sometimes argued that inferring causality 
amounts to simply inferring the time structure con- 
necting the observed variables, since the cause always 
preceeds the effect. This, however, ignores two im- 
portant facts: First, time order between two variables 
does not tell us whether the earlier one caused the later 



one, or whether both are due to a common cause. This 
paper addresses a second counter argument: a variable 
need not necessarily refer to a measurement performed 
at a certain time instance. Instead, a causal graph may 
formalize how intervening on some variables influences 
the equilibrium state of others. This describes a phe- 
nomenological level on which the original time struc- 
ture between variables gets lost, but causal graphs und 
structural equations may still be well-defined. On this 
level, also cyclic structural equations get a natural and 
well-defined meaning. 

For simplicity, we consider only deterministic systems, 
and leave the extension to stochastic systems with pos- 
sible confounding as future work. 

2 Ordinary Differential Equations 

Let X := {1, . . . , D} be an index set of variable labels. 
Consider variables Xi £ IZi for i £ I, where IZi C 
M di . We will use normal font for a single variable and 
boldface for a tuple of variables Xj £ Yiipi ^*- 



2.1 Observational system 

Consider a dynamical system T> described by D cou- 
pled first-order ordinary differential equations and an 
initial condition X £ IZz^j 

X t (t) = fi(X pav{i) ), X l (0) = (X ) l Vi£l (1) 

Here, pa^i) C X is the set of (indices of) parents^ 



of variable Xj, and each fi : TZ pav ^) — > TZi is a (suf- 
ficiently smooth) function. This dynamical system is 
assumed to describe the "natural" or "observational" 
state of the system, without any intervention from 
outside. We will assume that if j £ pa-p(i), then fi 
depends on Xj (in other words, fi should not be con- 
stant when varying Xj). Slightly abusing terminology, 
we will henceforth call such a dynamical system T> an 
Ordinary Differential Equation (ODE). 

The structure of these differential equations can be 
represented as a directed graph Qx> , with one node for 
each variable and a directed edge from Xi to Xj if and 
only if Xj depends on Xi . 

2.1.1 Example: the Lotka-Volterra model 



The Lotka-Volterra model (jMurravl 120021) is a well- 
known model from population biology, modeling the 
mutual influence of the abundance of prey X\ £ [0, oo) 
(e.g., rabbits) and the abundance of predators Xi £ 



1 We write X := 4§. 

2 Note that Xi can be a parent of itself. 





(a) & 



(b) 0d. 



do(x 2 =e 2 ) 



Figure 1: (a) Graph of the Lotka-Volterra model ((2|); 
(b) Graph of the same ODE after the intervention 
do(V2 = £2), corresponding with §5§. 



[0,oo) (e.g., wolves): 
= X 1 (6 11 - 

= — ^2(022 



'12^2) 
" #21 Vi) 



Xi(0) 
X 2 (0) 



(2) 



with all parameters 9ij > and initial condition satis- 
fying a > 0, b > 0. The graph of this system is depicted 



in Figure 1(a) 



2.2 Perfect interventions 

Interventions on the system T> described in (J]} can be 
modeled in different ways. Here we will focus on "per- 
fect" interventions: for a subset /Clof components, 
we force the value of Xj to attain some value £/ £ IZi . 
In particular, we will assume that the intervention is 
active from t = to t = 00, and that its value £/ does 
not change over time. In spired by the do-operator in- 
troduced by IPearll (|2000h , we will denote this type of 
intervention as do(Xj = £1). 

On the level of the ODE, there are many ways of re- 
alizing a given perfect intervention. One possible way 
is to add terms of the form n(£i — Xi) (with n > 0) to 
the expression for Xi, for all i £ I: 



m) - 



fi(X pav (i)) +K(£i-Xi) 



[fi{X pa , v (i)) 

Xi(0) = (X )i 



i£l 
i£l\I, (3) 



This would correspond to extending the system by 
components which monitor the values of {X{\i^i and 
exert negative feedback if they deviate from their tar- 
get values {£,i}i£i- Subsequently, we let k — > 00 to 
consider the idealized situation in which the interven- 
tion completely overrides the other mechanisms that 
normally determine the value of Xj. Assuming that 
the functions {fi}i£i are bounded, we can let n — > 00 
and obtain the intervened system £>do(x 7 =£j) : 



Xi{i) 



X l (0) 







i£l 



fi{X vayD (i)) i£l\I, 



Si 
(X ) 



i£l 

i£l\I 



(4) 



A perfect intervention changes the graph Qx> associ- 
ated to the ODE T> by removing the incoming arrows 
on the nodes corresponding to the intervened vari- 
ables {Xi} ie j. It also changes the parent sets of inter- 
vened variables: for each i G I, pa-p(i) is replaced by 
P a v do(Xl=il )(i)=®- 

2.2.1 Example: Lotka-Volterra model 



Let us return to the example in section 12.1.11 In this 
context, consider the perfect intervention do(X 2 = £2)- 
This intervention could be realized by monitoring the 
abundance of wolves very precisely and making sure 
that the number equals the target value £2 at all time 
(for example, by killing an excess of wolves and in- 
troducing new wolves from some reservoir of wolves). 
This leads to the following intervened ODE: 




2*2) 



Xi(0) =a 



(5) 



[Jf a (0) = 6 

The corresponding intervened graph is illustrated in 



Figure 1(b) 



2.3 Stability 

An important concept in our context is stability, de- 
fined as follows: 

Definition 1 The ODE T> specified in j7]) is called 
stable if there exists a unique equilibrium state X* € 
IZz such that for any initial state Xq G IZj, the system 
converges to this equilibrium state as t — > 00: 



3! 



x*eK x vx e7?,i : hm X(t) — X* 
t— ^00 



V, 



One can weaken the stability condition by demanding 
convergence to and uniqueness of the equilibrium only 
for a certain subset of all initial states. For clarity of 
exposition, we will use this strong stability condition. 

We can extend this concept of stability by considering 
a certain set of perfect interventions: 

Definition 2 Let J C V(1)E The ODE V specified 
in (Qp is called stable with respect to J if for all I G J 
and for all $,1 G IZi, the intervened ODE T>^ /x i —^a 



has a unique equilibrium state X% 



(Xi=Si 



G TZx such 



that for any initial state Xq G TZx with (-Xo)j = £/, 
the system converges to this equilibrium as t — > 00: 

3!x* r , T) eKx VxoGKxs.*. : 1™ X{t) = X* do{x ^ y 



3 For a set A, we denote with V(A) the power set of A 
(the set of all subsets of A) . 



This definition can also be weakened by not demand- 
ing stability for all £/ G TZi, but for smaller subsets 
instead. Again, we will use this strong condition for 
clarity of exposition, although in a concrete example to 
be discussed later (see Section l!2.3.2[) . we will actually 
weaken the stability assumption along these lines. 

2.3.1 Example: the Lotka-Volterra model 

The ODE ^ of the Lotka-V olterra model i s not sta- 
ble, as discussed in detail by iMurravl (|2002i ). Indeed, 
it has two equilibrium states, (X*,X|) = (0,0) and 
(Xf,X%) = (622/621,011/612). The Jacobian of the 
dynamics is given by: 



V/(X) 



'11 — 6x2X2 
621X2 



—6\iX\ 

?22 + 6ilX\ 



In the first equilibrium state, it has a positive and a 
negative eigenvalue (6\\ and — #22, respectively), and 
hence this equilibrium is unstable. At the second equi- 
librium state, the Jacobian simplifies to 







1/61 



'126*22/^21 




which has two imaginary eigenvalues, ±?v5ii^22- One 
can show (JMurravl . 120021 ) that the steady state of the 
system is an undamped oscillation around this equi- 
librium. 

The intervened system ([5]) is only generically stable, 
i.e., for most values of &'■ the unique stable equilib- 
rium state is (X*, X|) = (0, £2) as long as 611 — 612& ¥" 
0. If 6n — 612^2 = 0, there exists a family of equilibria 

(A7,X 2 *) = (c,6)withc>0. 

2.3.2 Example: damped harmonic oscillators 

The favorite toy example of physicists is a system 
of coupled harmonic oscillators. Consider a one- 
dimensional system of D point masses rm (i = 
1, . . . , D) with positions Qi G K and momenta Pi G K, 
coupled by springs with spring constants fc, and equi- 
librium lengths U, under influence of friction with fric- 
tion coefficients bi, with fixed end positions (see also 
Figure [2}. 

We first sketch the qualitative behavior: there is a 
unique equilibrium position where the sum of forces 
vanishes for every single mass. Moving one or several 



q = 



mi rri2 ms 771,4 



HP k fei k 2 k 3 fc 4 ; 



Figure 2: Mass-spring system for D = 4. 



masses out of their equilibrium position stimulates vi- 
brations of the entire system. Damped by friction, ev- 
ery mass converges to its unique and stable equilibrium 
position in the limit of infinite time. If one or several 
masses are fixed to positions different from their equi- 
librium points, the positions of the remaining masses 
still converge to unique (but different) equilibrium po- 
sitions. The structural equations that we derive later 
will describe the change of the unconstrained equilib- 
rium positions caused by fixing the others. 

The equations of motion for this system are given by: 



-Pi 



Pi = h{Qi+i - Qi - k) 

- h-i(Qi - Qi-i - k-i) - 
Qi = Pi /mi 
where we define Qo :— and Qd+i '■= L. The graph 



of this ODE is depicted in Figure 3(a) At equilibrium 
(for t — » oo), all momenta vanish, and the following 
equilibrium equations hold: 



= ki(Q i+ i 
0=R 



k) - h-i(Qi 



k-i) 



which is a linear system of equations in terms of the Qi. 
There are D equations for D unknowns Qi,..., Qd, 
and one can easily check that it has a unique solution. 

A perfect intervention on Qi corresponds to fixat- 
ing the position of the j'th mass. Physically, this is 
achieved by adding a force that drives Qi to some 
fixed location, i.e., the intervention on Qi is achieved 
through modifying the equation of motion for Pi. To 
deal with this example in our framework, we consider 
the pairs Xi := (Qi,Pi) € R 2 to be the elementary 
variables. Consider for example the perfect interven- 
tion do(A 2 = (£2,0)), which effectively replaces the 
dynamical equations Q2 and Pi by Qi — 0, P2 = 
and their initial conditions by {Qv)i = £2, (^0)2 = 0. 
The graph of the corresponding ODE is depicted in 
Figure |3(b)| Because of the friction, also this inter- 
vened system converges to a unique equilibrium that 
does not depend on the initial value. 

This holds more generally: for any perfect interven- 
tion on (any number) of pairs Xi of the type do(A^ = 
(£i, 0)), the intervened system will converge towards a 
unique equilibrium because of the damping term. In- 
terventions that result in a nonzero value for any mo- 
mentum Pi while the corresponding position is fixed 
are physically impossible, and hence will not be consid- 
ered. Concluding, we have seen that the mass-spring 
system is stable with respect to perfect interventions 
on any number of position variables, which we model 
mathematically as a joint intervention on the corre- 
sponding pairs of position and momentum variables. 




(a) Qi 




(b)& 



do(Q 2 =J 2 ,P 2 =0) 



Figure 3: Graphs of the dynamics of the mass-spring 
system for D = 4. (a) Observational situation (b) 
Intervention do(Q2 = £2, P2 = 0). 

3 Equilibrium equations 

In this section, we will study how the dynamical equa- 
tions give rise to equilibrium equations that describe 
equilibrium states, and how these change under perfect 
interventions. This is an intermediate representation 
on our way to structural causal models. 

3.1 Observational system 

At equilibrium, the rate of change of any variable is 
zero, by definition. Therefore, an equilibrium state of 
the observational system V defined in (p} satisfies the 
following equilibrium equations: 



= /iCXpa^i)) Vie I. 



(6) 



This is a set of D coupled equations with unknowns 
Xi,...,Xjj. The stability assumption (cf. Defini- 
tion [T]) implies that there exists a unique solution X* 
of the equilibrium equations (j6|). 

3.2 Intervened systems 

Similarly, for the intervened system £>do(Xi=£;) defined 
in (j4|), we obtain the following equilibrium equations: 



= x, - & 

= fjiXpavU)) 



Mi el 
Y? £l\I 



(7) 



If the system is stable with respect to this interven- 
tion (cf. Definition [2]) , then there exists a unique so- 
lution X% , x _> , of the intervened equilibrium equa- 
tions (0). 



Note that we can also go directly from the equilib- 
rium equations ((6]) of the observational system T> to 



the equilibrium equations (|7J| of the intervened system 
2?do(Xr=£r)j simply by replacing the equilibrium equa- 
tions "0 = fi(X pa u\)" for i £ I by equations of the 
form "0 = Xi — £j". Indeed, note that the modified 
dynamical equation 

Xj = fi(X pSiv (i)) + k(£ 4 - Xi) 

yields an equilibrium equation of the form 

o = Mx p ^ (l) ) + K & - x. t ) 

which, in the limit k — ¥ oo, reduces to = Xi — £j 
(assuming that fi is bounded). This seemingly trivial 
observation will turn out to be quite important. 

3.3 Labeling equilibrium equations 

If we would consider the equilibrium equations as a 
set of unlabeled equations {Si : i £ I}, where Si de- 
notes the equilibrium equation "0 = /j(X pai) (j))" (or 
"0 = Xi — £j" after an intervention) for i £ X, then we 
will not be able to correctly predict the result of inter- 
ventions, as we do not know which of the equilibrium 
equations should be changed in order to model the 
particular intervention. This information is present in 
the dynamical system T> (indeed, the terms "Xj" in 
the l.h.s. of the dynamical equations in ([1]) indicate 
the targets of the intervention) , but is lost when con- 
sidering the corresponding equilibrium equations © 
as an unlabeled set (because the terms "Xj" are all 
replaced by zeroes). 

This important information can be preserved by la- 
beling the equilibrium equations. Indeed, the labeled 
set of equilibrium equations S := {(i,Si) : i £ 1} 
contains all information needed to predict how equi- 
librium states change on arbitrary (perfect) interven- 
tions. Under an intervention do(X/ = £j), the equi- 
librium equations are changed as follows: for each in- 
tervened component i £ I, the equilibrium equation Si 
is replaced by the equation Si defined as "0 = Xi — £j" , 
whereas the other equilibrium equations Sj for j £ X\I 
do not change. Assuming that the dynamical system 
is stable with respect to this intervention, this modi- 
fied system of equilibrium equations describes the new 
equilibrium obtained under the intervention. We con- 
clude that the information about the values of equilib- 
rium states and how these change under perfect inter- 
ventions is encoded in the labeled equilibrium equa- 
tions. 

3.4 Labeled equilibrium equations 

The previous considerations motivate the following 
formal definition of a system of Labeled Equilibrium 
Equations (LEE) and their semantics under interven- 
tions. 



Definition 3 A system of Labeled Equilibrium 
Equations (LEE) S for D variables {Xi}i £ z with I :— 
{!,..., D} consists of D labeled equations of the form 



Si : = ffi(Xpa (i)), i £ X, 



(8) 



where pa £ (i) C X is the set of (indices of) parents of 
variable Xi, and each gi : TZ pa , (i) — > TZi is a function. 

The structure of an LEE S can be represented as a 
directed graph Qg, with one node for each variable 
and a directed edge from Xi to Xj (with j ^ i) if and 
only if Si depends on Xj . 

A perfect intervention transforms an LEE into another 
(intervened) LEE: 

Definition 4 Let I CX and £i £ IZi . For the perfect 
intervention do(X/ = £/) that forces the variables Xi 
to take the value £j, the intervened LEE £do(x 7 =£j) * s 
obtained by replacing the labeled equations of the origi- 
nal LEE S by the following modified labeled equations: 



Xj — £j 



i£l 



3i(X P a £ (i)) i£l\L. 



(9) 



We define the concept of solvability for LEEs that mir- 
rors the definition of stability for ODEs: 

Definition 5 An LEE S is called solvable if there ex- 
ists a unique solution X* to the system of (labeled) 
equations {Si}. An LEE S is called solvable with re- 
spect to J C V(X) if for all I £ J and for all £/ £ Tij, 
the intervened LEE £do(Xi=£i) * s solvable. 

As we saw in the previous section, an ODE induces an 
LEE in a straightforward way. The graph Q^ of the 
induced LEE Sx> is equal to the graph Qj> of the ODE 
T>. It is immediate that if the ODE V is stable, then 
the induced LEE St> is solvable. As we saw at the end 
of Section 13. 2\ our ways of modeling interventions on 
ODEs and on LEEs are compatible. We will spell out 
this important result in detail. 

Theorem 1 Let V be an ODE, I C X and £j € Hi. 

(i) Applying the perfect intervention do(X/ = £j) to 
the induced LEE Sx> gives the same result as construct- 
ing the LEE corresponding to the intervened ODE 



V. 



do(Xr=£j) 



(£v) 



do(Xj=£r) 



St 



(ii) Stability of the ODE T> with respect to the interven 
tion do(X/ = £/) implies solvability of the induced in- 
tervened LEE Stj 
librium and solution X 



, x , ,, and the corresponding equi- 



L do(Xr=£/ 



are identical. 



□ 



3.5 Example: damped harmonic oscillators 

Consider again the example of the damped, coupled 
harmonic oscillators of section [2.3.2l The labeled equi- 
librium equations are given explicitly by: 



= ki(Qi+i ~ Qi - k) 

— ki-i(Qi — Qi-i — li-i) 
=P, 



(10) 



Definition 7 Let I C.I and £/ € IZi . For the perfect 
intervention do(X/ = £j) that forces the variables Xj 
to take the value £/, the intervened SCM .Mdo(Xj=£j) 
is obtained by replacing the structural equations of the 
original SCM A\ by the following modified structural 
equations: 



X,= 



6 iel 



(12) 



4 Structural Causal Models 

In this section we will show how an LEE representation 
can be mapped to the more popular representation of 
Structural Causal Models , also known as Structural 
Equation M odels dBollenl Il989h - We follow the ter- 
minology of IPearll ( 20001 ) . but consider here only the 
subclass of deterministic SCMs. 

4.1 Observational 

The following definiti on is a spec ial case of the more 
general definition in ( Pearl l200Ct Section 1.4.1): 



Definition 6 A deterministic Structural Causal 
Model (SCM) M on D variables {Xi} ieX with 
T := {1, . . . , D} consists of D structural equations of 
the form 



Xi — hi(X pa 



i(i)J 



i£l, 



(11) 



where pa, M (i) (=2\ {i} is the set of (indices of) par 

ents of variable Xi, and each h 

function. 



T^pa M (i) -*• ftt is a 



Each structural equation contains a function hi that 
depends on the components of X in p& M (i). We think 
of the parents pa_ A/) («) as the direct causes of Xi (rel- 
ative to Xx) and the function hi as the causal mech- 
anism that maps the direct causes to the effect Xi. 
Note that the l.h.s. of a structural equation by defini- 
tion contains only Xi, and that the r.h.s. is a function 
of variables excluding Xi itself. In other words, Xi is 
not considered to be a direct cause of itself. The struc- 
ture of an SCM Ai is often represented as a directed 
graph Qmi with one node for each variable and a di- 
rected edge from Xi to Xj (with j ^ i) if and only if hi 
depends on Xj . Note that this graph does not contain 
"self- loops" (edges pointing from a node to itself), by 
definition. 

4.2 Interventions 



A Structural Causal Model M. comes with a s pecific 
sema ntics for modeling perfect interventions ([Pearl 



The reason that the equations in a SCM are called 
"structural equations" (instead of simply "equations" ) 
is that they also contain information for modeling in- 
terventions, in a similar way as the labeled equilibrium 
equations contain this information. In particular, the 
l.h.s. of the structural equations indicate the targets 
of an interventionQ 

4.3 Solvability 

Similarly to our definition for LEEs, we define: 

Definition 8 An SCM Ai is called solvable if there 
exists a unique solution X* to the system of structural 
equations. An SCM Ai is called solvable with respect 
to J C V{1) if for all I G J and for all £j e Hi, the 
intervened SCM ^ / t ( jo(JC/=$/) * s solvable. 

Note that each (deterministic) SCM Ai with acyclic 
graph Gm is solvable, even with respect to the set of 
all possible intervention targets, V(T). This is not 
necessarily true if directed cycles are present. 

4.4 From labeled equilibrium equations to 
deterministic SCMs 

Finally, we will now show that under certain stability 
assumptions on an ODE 2?, we can represent the in- 
formation about (intervened) equilibrium states that 
is contained in the corresponding set of labeled equi- 
librium equations £p as an SCM Ale?,- 

First, given an LEE £ , we will construct an induced 
SCM Ate, provided certain solvability conditions hold: 



Definition 9 If for each i El, the LEE £ is solvable 
with respect to some li C I with pa e (i) \ {i} C 7, C 
X\ {i}, then it is called structurally solvable. 



2000): 



4 In Pearl (2000)'s words: "Mathematically, the distinc- 
tion between structural and algebraic equations is that the 
latter are characterized by the set of solutions to the entire 
system of equations, whereas the former are characterized 
by the solutions of each individual equation. The implica- 
tion is that any subset of structural equations is, in itself, 
a valid model of reality — one that prevails under some set 
of interventions." 



If the LEE £ is structurally solvable, we can proceed 
as follows. Let i £ 1. We define the induced parent set 
P a A4£ (*) := P a f(*) \ {*}• Assuming structural solvabil- 
ity of £, under the perfect intervention do(X^ = £r 4 ), 
there is a unique solution X^ , x =£ 1 to ^ Qe inter- 
vened LEE, for any value of £j 4 € 7?.^ . This defines a 

IZi given by the z'th compo- 



function hi : 1Z 



paA4„(i) 



The i'th struc- 



nent Mlpa^w) := {Xdo(x h =£ h )) 

tural equation of the induced SCM Me is defined as: 

Xi = hi(X p3iM£ {i)). 

Note that this equation is equivalent to the labeled 
equation Ei in the sense that they have identical solu- 
tion sets { (X% , X* u))}- Repeating this procedure 
for all i £ I, we obtain the induced SCM Me- 

This construction is designed to preserve the impor- 
tant mathematical structure. In particular: 

Lemma 1 Let £ be an LEE, LCI and £/ £ IZi 

and consider the perfect intervention do(Xj = £/). 
Suppose that both the LEE £ and the intervened LEE 
£do(x z=£j) are structurally solvable, (i) Applying the 
intervention do(Xi = £/) to the induced SCM Me 
gives the same result as constructing the SCM corre- 
sponding to the intervened LEE £do(x I =£ I ) : 



(M £ )do(X I =£ I ) 



M £d 



(ii) Solvability of the LEE £ with respect to the in- 
tervention do(Xi = £i) implies solvability of the 
induced SCM Me with respect to the same inter- 
vention &o(Xi — £f), and their respective solutions 



X 



do(Xj=£j) 



are identical. 



Proof. The first statement directly follows from the 
construction of the induced SCM. The key observa- 
tion regarding solvability is the following. From the 
construction above it directly follows that 



V 



x 



pa £ («) 
= 



ev. 



pa £ (0 



9i{X pa , e (i)) «= 
This trivially implies that 

VxeKx : = 9i{X pa£ (i)) 



Xi — hiiX 



pa 



ed)\{i})- 



x. L = hi(X. 



P a M *(*)) 



This implies that each simultaneous solution of all la- 
beled equations is a simultaneous solution of all struc- 
tural equations, and vice versa: 



V 



xeTZi 



(jVi e z : =5i(X pag( j))] 

^=^ [V i£ i:Xi = /ii(^Tpa A4e («))] 



The crucial point is that this still holds if an interven- 
tion replaces some of the equations (by = Xi — & 
and Xi = £i, respectively, for alii £ I). □ 



4.5 From ODEs to deterministic SCMs 

We can now combine all the results and definitions so 
far to construct a deterministic SCM from an ODE 
under certain stability conditions. We define: 

Definition 10 An ODE T> is called structurally sta- 
ble if for each i £ I, the ODET> is stable with respect 
to some Li CI with pa-p(i) \ {i} C/jCI\ {i}. 

Consider the diagram in Figure |U Here, the labels of 
the arrows correspond with the numbers of the sec- 
tions that discuss the corresponding mapping. The 
downward mappings correspond with a particular in- 
tervention do(Xi = £/), applied at the different levels 
(ODE, induced LEE, induced SCM). Our main result: 



Theorem 2 If both the ODE T> and the intervened 
ODET> do /x I= ^ I ) are structurally stable, then: (i) The 
diagram in Figure [J] commutes^ (ii) If furthermore, 
the ODE T> is stable with respect to the intervention 
do(Xi — £i), the SCM Ms-r, has a unique 

solution that coincides with the stable equilibrium of 
the intervened ODE X><wx /= £ 7 ). 

Proof. Immediate from Theorem [T] and Lemma [TJ □ 



Note that even though the ODE may contain self- 
loops (i.e., the time derivative Xi could depend on 
Xi itself, and hence i £ pap(i)), the induced SCM 
Ms-o does not contain self-loops by construction (i.e., 
i $ P&Me (*))• Somewhat surprisingly, the structural 
stability conditions actually imply the existence of self- 
loops (because if Xi would not occur in the equilibrium 
equation (£v)i, its value would be undetermined and 
hence the equilibrium would not be unique) . 

Whether one prefers the SCM representation over the 
LEE representation is mainly a matter of practical con- 
siderations: both representations contain all the neces- 
sary information to predict the results of arbitrary per- 
fect interventions, and one can easily go from the LEE 
representation to the SCM representation. One can 
also easily go in the opposite direction, but this can- 
not be done in a unique way. For example, one could 
rewrite each structural equation Xi = hi{X p& (r\) as 
the equilibrium equation = hi(X pa JJVi (i)) — Xi, but 
also as the equilibrium equation = hf (X pa u\ ) — Xf 
(in both cases, it would be given the label i). 



In case the dynamics contains no directed cycles (not 
considering self- loops), the advantage of the SCM rep- 
resentation is that it is more explicit. Starting at 



J This means that it does not matter in which direction 
one follows the arrows, the end result will be the same. 



ODE 

V 



intervened ODE 



V, 



do(Xj=£r) 



LEE 
Gv 



SCM 



intervened LEE 



intervened SCM 



Figure 4: Each of the arrows in the diagram corresponds with a mapping that is described in the section that 
the label refers to. The dashed arrows are only defined under structural solvability assumptions on the LEE. If 
the ODE T> and intervened ODE 2?do(Xj=£j) are structurally stable, this diagram commutes (cf. Theorem^. 



the variables without parents, and following the topo- 
logical ordering of the corresponding directed acyclic 
graph, we directly obtain the solution of an SCM by 
simple substitution in a finite number of steps. On the 
other hand, the LEE representation is more implicit, 
and we need to solve a set of equations. In the cyclic 
case, one needs to solve a set of equations in both rep- 
resentations, and the difference is merely cosmetical. 
However, one could argue that the LEE representation 
is slightly more natural in the cyclic case, as it does 
not force us to make additional (structural) stability 
assumptions. 

4.6 Example: damped harmonic oscillators 

Figure [5] shows the graph of the structural causal 
model induced by our construction. It reflects the 
intuition that at equilibrium, (the position of) each 
mass has a direct causal influence on (the positions 
of) its neighbors. Observing that the momentum vari- 
ables always vanish at equilibrium (even for any per- 
fect intervention that we consider) , we can decide that 
the only relevant variables for the SCM are the posi- 
tion variables Qi. Then, we end up with the following 
structural equations: 



Qi 



h{Qi+i -li) + h-i{Qi-i +k-i) 



H+l 



(13) 



5 Discussion 



In many empirical sciences (physics, chemistry, biol- 
ogy, etc.) and in engineering, differential equations 
are a commonly used modeling tool. When estimating 
system characteristics from data, they are especially 



Xi 



X 



Figure 5: Graph of the structural causal model in- 
duced by the mass-spring system for D = 4. 



useful if measurements can be done on the relevant 
time scale. If equilibration time scales become too 
small with respect to the temporal resolution of mea- 
surements, however, the more natural representation 
may be in terms of structural causal models. The main 
contribution of this work is to build an explicit bridge 
from the world of differential equations to the world 
of causal models. Our hope is that this may aid in 
broadening the impact of causal modeling. 

Note that information is lost when going from a dy- 
namical system representation to an equilibrium repre- 
sentation (either LEE or SCM), in particular the rate 
of convergence toward equilibrium. If time-series data 
is available, the most natural representation may be 
the dynamical system representation. If only snapshot 
data or equilibrium data is available, the dynamical 
system representation can be considered to be overly 
complicated, and one may use the LEE or SCM rep- 
resentation instead. 

We have shown one particular way in which structural 
causal models can be "derived" . We do not claim that 
this is the only way: on the contrary, SCMs can prob- 
ably be obtained in several other ways and from other 
representations as well. One issue that we have not 
yet addressed is that of constants of motion. For ex- 
ample, if we would not fix the end points of the chain 
of harmonic oscillators, then the total momentum of 
the system would depend on the initial condition, and 
therefore the dynamics would not be stable anymore 
according to the definition we have used here. We 
believe that these and similar issues can probably be 
solved by being more explicit about which variables in 
the dynamics will become part of the structural causal 
model. We plan to address this in future work. 

We also intend to extend the basic framework de- 
scribed here towards the more general stochastic case. 
Uncertainty or "noise" can enter in two different ways: 
via uncertainty about certain (constant) parameters 
of the differential equations, and via latent variables. 
A complicating factor that has to be addressed then 



(which does not play a role in the deterministic case) 
is confounding. 
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